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5 

Field of the Invention 

The present invention relates to the field of computer system, and more particularly to a 
branch predictor using branch prediction accuracy history and efficient processing techniques for 
instruction streams which include conditional program flow instructions, such as branch 
10 instructions. 

Background of the Invention 
m Many microprocessors employ a technique known as hardware pipelining to increase 

^ instruction throughput by processing several instructions through different phases of execution 

15 concurrently. To maximize instruction execution efficiency, it is desirable to keep the instruction 
In execution pipeline full (with an instruction being processed in each pipeline stage) as often as 
possible such that the pipeline produces useful output every clock cycle. However, whenever 
there has been a transfer of program flow control to another section of software code and 
ftp instructions have been speculatively fetched and processed and it is determined that these 
S 20 instructions should not have been executed, the output from the pipeline is not useful 
u Exceptions and program flow control instructions such as branch instructions provide 

examples of how the program flow control can be changed. Branch instructions, which may be 
conditional or unconditional and may transfer program flow control to a preceding or subsequent 
code section, are used for frequently encountered situations where a change in program flow 
25 control is desired. 

A conditional branch instruction determines instruction flow based on the resolution of a 
specified condition. If A>B then branch to instruction X is an example of a conditional branch 
instruction. In this case, if A>B, program flow control branches to a code section beginning with 
instruction X, also referred to as the target code section. If A is not greater than B, the 
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instructions sequentially following the branch instruction in the program flow, referred to as the 
sequential code section, are executed. In executing such conditional branch instruction, it is 
required to check a condition of the branch instruction for determining the next instruction. Thus, 
performance of a microprocessor including a central processing unit (CPU) may be adversely 
5 affected in pipeline procedures of the microprocessor requiring fast instruction fetch. 

To solve the aforementioned problem, many microprocessors adopt a branch predictor (or 
a branch prediction logic), which operates to predict the outcome of a branch instruction before 
identifying a condition check of the branch instruction, based on a predetermined branch 
prediction approach. Thus, instructions are then speculatively fetched from either the target code 
10 section or the sequential code section based on the prediction indicated by the branch predictor. 
Therefore, a pipeline stall can be prevented. However, when a branch prediction is missed, many 
instructions from the incorrect code section may be in various stages of processing in the 
y3 instruction execution pipeline. On encountering such a misprediction, instructions following the 
J mispredicted conditional branch instruction in the pipeline (or multiple pipelines) are flushed, 
ff 15 and instructions from the other correct code section are fetched. Flushing the pipeline creates 
in bubbles or gaps in the pipeline. Several clock cycles may be required before the next useful 
g " instruction completes execution, and before the instruction execution pipeline produces useful 
2 output. Such an incorrect guess causes the pipeline to stall until it is refilled with valid 
fy instructions. This delay is called the mispredicted branch penalty. 

£j 20 To reduce above described misprediction ratio, various kinds of branch predictors are 

used. Among the branch predictors, a two-level branch predictor is likely to become more 
common. A P6 processor of Intel Corporation is the first to use a two-level branch algorithm to 
improve accuracy. This algorithm, first published by Tse-Yu Yeh and Yale Patt, has the potential 
to push accuracy well beyond the 90% level achieved by the best processors today. 
25 Fig. 1 is a schematic diagram for illustrating a structure of a conventional two-level 

branch predictor. For example, the branch predictor is illustrated in Fig. 2 of New Algorithm 
Improves Branch Prediction by Linley Gwennap, March 27, 1995, MOCROPROCESSOR, pp. 
17-21. 

Referring to Fig. 1, the two-level branch predictor is composed of a branch history 
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register (BHR) 10 and a pattern history table (PHT) 20. The branch history register 10 is used for 
recording the actions of the most recent k conditional branches. For example, a 1 stored in the 
branch history register 10 may denote a branch taken , and a 0 stored in the branch history 
register 10 may denote a branch not taken , respectively. The performed k conditional branches 
5 are called a pattern. 

The pattern history table 20 is used for recording a pattern history bit Sc, which is used 
for predicting a conditional branch of a branch instruction to be performed in response to each 
pattern. For example, the two-level branch predictor predicts a conditional branch I(Sc) in 
response to an entry of 10 stored in the pattern history table 20. The entry corresponds with a 
10 pattern 111010 stored in the branch history register 10. According to the predicted conditional 
branch I(Sc), the next instruction to the branch instruction is fetched. Referring to the Gwennap 
paper referenced above, a predicted conditional branch I(Sc) is determined by a most significant 
v3 bit (MSB) of a pattern history bit Sc stored in the pattern history table 20. 
y3 For example, on the assumption that a real conditional branch of the branch instruction is 

2 1 5 Rc, if a predicted conditional branch I(Sc) is different from the real conditional branch Rc, this 
F: case is called a prediction miss. In this case, execution of instructions following the mispredicted 
s conditional branch I(Sc) are withdrawn. 

S According to the real conditional branch Rc, both data of the branch history register 10 

2 and the pattern history bit Sc stored in the pattern history table 20 are changed. This process is 
Q 20 described as follows. When a least significant bit (LSB) corresponding to the real conditional 

branch Rc of the branch instruction is stored to the branch history register 10, the remaining bits 
are shifted to the left. At this time, the pattern history bit Sc stored in the pattern history table 20 
is updated in response to the real conditional branch Rc. For example, if the real conditional 
branch Rc is 1 denoting predict taken , the pattern history bit Sc is increased by 1, and if the 
25 real conditional branch Rc is 0 denoting predict not taken , the pattern history bit Sc is 

decreased by 1 . The pattern history bit Sc can be composed of an up/down saturating counter as 
shown in A Study of Branch Prediction Strategies , by J. Smith, May 1981, pp. 135-148. The 
saturating counter maintains a minimal value of a pattern history bit Sc when the pattern history 
bit Sc is the minimal value, although the real conditional branch Rc is 0 denoting not taken . In 
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addition, the saturating counter maintains a maximum value of a pattern history bit Sc when the 
pattern history bit Sc is the maximum value, although the real conditional branch Rc is 1 
denoting taken . 

Although branch prediction accuracy may be improved or turned by using different 
5 branch prediction algorithms, mispredictions still occur. By the time a misprediction is identified, 
many instructions from the incorrect code section may be in various stages of processing in the 
instruction execution pipeline. 

An example of a solution to the forgoing performance penalty relevant to mispredicting is 
disclosed in U.S. Pat. No. 5,860,017 to Sharangpani et aL, issued on Jan. 12, 1999, entitled, 
10 "Processor and Method for Speculatively Executing Instructions from Multiple Instruction 
Streams Indicated by a Branch Instruction," which identifies branch instructions, which in 
relationship to other conditional branch instructions, have a relatively high likelihood of being 
J} mispredicted. In this case, once a condition in a branch instruction is identified as being unlikely 
^ to be predicted accurately, the processor fetches and decodes instructions from both target and 
W 15 sequential instruction streams indicated by the conditional branch instruction. However, the 
in method proposed by Sharangpani et aL may cause performance deterioration by a resource 

conflict and may lead to high hardware cost, since the processor fetches both target and 
? f sequential instruction streams. Therefore, there is a need for a branch predictor capable of 
ftf efficient processing of branch instructions by reducing prediction miss with a comparatively 
20 simple circuit configuration and low hardware cost. 

Summary of the Invention 

It is therefore an object of the present invention to provide a branch predictor capable of 
efficiently processing branch instructions by reducing prediction misses with a comparatively 
25 simple circuit configuration and low hardware cost. 

According to an aspect of the present invention, there is provided a branch predictor 
which includes branch prediction means for predicting a conditional branch of a branch 
instruction. A comparator generates a comparison signal by comparing the predicted conditional 
branch from the branch prediction means with a real conditional branch of the branch instruction. 
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An accuracy history table stores an accuracy history of the predicted conditional branch. A first 
state transition logic generates an accuracy history bit to be stored to the accuracy history table in 
response to the comparison signal. A multiplexer outputs either the conditional branch or an 
inverted conditional branch as a final branch prediction outcome, in response to a predicted 
5 accuracy history signal based on the accuracy history bit. 

Brief Description of the Drawings 

The foregoing and other objects, features and advantages of the invention will be apparent 
from the following more particular description of preferred embodiments of the invention, as 
10 illustrated in the accompanying drawings in which like reference characters refer to the same 
parts throughout the different views. The drawings are not necessarily to scale, emphasis instead 
being placed upon illustrating the principles of the invention. 
\§ Fig. 1 is a schematic diagram illustrating a structure of a conventional two-level branch 

jjij predictor. 

^ 15 Fig. 2 is a schematic diagram illustrating a structure of one embodiment of a two-level 

is; ass 

Lf| branch predictor according to the present invention. 
^ Description of the Preferred Embodiment 

f D In accordance with the invention, a branch predictor outputs either a predicted conditional 

r 3 20 branch or an inverted predicted conditional branch as a final branch prediction outcome, in 
^ response to a predicted accuracy history signal based on an accuracy history bit. According to the 
accuracy history bit, it is determined whether the branch prediction outcome of the branch 
predictor is correct. If the predicted conditional branch is correct, the branch predictor outputs the 
predicted conditional branch, and if the predicted conditional branch is not correct, the branch 
25 predictor outputs the inverted predicted conditional branch, in response to the predicted accuracy 
history signal. 

Fig. 2 is a schematic diagram illustrating a structure of one embodiment of a two-level 
branch predictor according to the present invention. Referring to Fig. 2, the two-level branch 
predictor comprises a branch history register 15 for recording actions of the most recent k 
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conditional branches, a pattern history table 25 for recording a pattern history bit Sc used for 
generating a predicted conditional branch I(Sc), and an accuracy history table 60 for recording 
accuracy history of the predicted conditional branch I(Sc). The accuracy history table 60 is 
composed of a memory array. 

A first state transition logic circuit 30 generating a pattern history bit Sc to be stored to 
the pattern history table 25 in response to a real conditional branch Rc is coupled to the pattern 
history table 25. In addition, a second state transition logic circuit 50 generating an accuracy 
history bit Ac to be stored to the accuracy history table 60 is coupled to the accuracy history table 
60. 

Further, the branch predictor according to the present invention comprises a comparator 
40 generating a comparison signal by comparing the predicted conditional branch I(Sc) generated 
by the pattern history bit Sc with the real conditional branch Rc of the branch instruction. The 
comparison signal is inputted to the second state transition logic circuit 50 to generate the 
accuracy history bit Ac. In addition, the branch predictor comprises a multiplexer 70 selecting 
either a predicted conditional branch I(Sc) or an inverted predicted conditional branch as a final 
branch prediction outcome or result. A predicted accuracy history signal I(Ac) based on the 
accuracy history bit Ac is used as a selection signal for the multiplexer 70. Operation of the 
branch predictor is described as follows. 

A predicted conditional branch I(Sc) is generated in response to a pattern history bit Sc 
corresponding to a pattern stored in the branch history register 15. The predicted conditional 
branch I(Sc) is inputted to the comparator 40 to be compared with a real conditional branch Rc. 

The real conditional branch Rc has a 1 or 0 value according to "predict taken" or 
"predict not taken," respectively , and the value stored in the branch history register 15 is updated 
in response to the value of the real conditional branch Rc. According to the updated value of the 
branch history register 15, the pattern history bit Sc is updated. The first state transition logic 
circuit 30 updates the pattern history bit Sc. The first state transition logic circuit 30 is composed 
of an up/down saturating counter. In the first state transition logic circuit 30, the value of the 
pattern history bit Sc is increased by 1 when the real conditional branch Rc is 1 (i.e., taken), and 
the value of the pattern history bit Sc is decreased by 1 when the real conditional branch Rc is 0 
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(i.e., not taken). 

The predicted conditional branch I(Sc) has a value of 1 or 0 in response to a most 
significant bit (MSB) of the pattern history bit Sc. The comparator 40 outputs 1 or 0 as a 
comparison signal to the second state transition logic circuit 50 by comparing the real conditional 
5 branch Rc and the predicted conditional branch I(Sc). For example, if the predicted conditional 
branch I(Sc) is the same as the real conditional branch Rc, the comparator 40 outputs 1 , and if 
the predicted conditional branch I(Sc) is different from the real conditional branch Rc, the 
comparator 40 outputs 0 . 

The second state transition logic circuit 50 receiving the comparison signal determines an 
10 accuracy history bit Ac to be stored to the accuracy history table 60 in response to the comparison 
signal. The second state transition logic circuit 50 is composed of an up/down saturating counter 
O increasing the value of the accuracy history bit Ac by 1 when the predicted conditional branch 
S I(Sc) is the same as the real conditional branch Rc, and decreasing the value of the accuracy 
history bit Ac by 1 when the predicted conditional branch I(Sc) is different from the real 
1 5 conditional branch Rc. The accuracy history bit Ac can be used after learning a branch accuracy 
Cj of the corresponding pattern by monitoring the pattern. 

; = According to the above described method, the accuracy history bit Ac is determined and 

O stored to the accuracy history table 60. According to the accuracy history bit Ac, it can be 
Q determined whether a prediction result of the branch predictor is correct. For example, if a 
S 20 pattern history bit Sc is 011 corresponding to a pattern 11 10 stored in the branch history 
register 15, a predicted accuracy history signal I(Ac) is generated by an MSB of the accuracy 
history bit Ac. The predicted accuracy history signal I(Ac) is used for determining whether the 
predicted conditional branch I(Sc) is correct. For example, if it is considered as the predicted 
conditional branch I(Sc) is correct, the predicted accuracy history signal I(Ac) having a value of 
25 1 is outputted to the multiplexer 70. Thus, the predicted conditional branch I(Sc) is outputted 
from the multiplexer 70 as a final prediction result. In addition, if it is considered as the predicted 
conditional branch I(Sc) is not correct, the predicted accuracy history signal I(Ac) having a value 
of 0 is outputted to the multiplexer 70. Thus, the inverted predicted conditional branch is 
outputted from the multiplexer 70 as a final prediction result. As described above, the predicted 
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accuracy history signal I(Ac) is used as a selection signal of the multiplexer 70 selecting either 
the predicted conditional branch I(Sc) or an inverted predicted conditional branch as a final 
prediction outcome of the branch predictor. 

As described above, the branch predictor according to the present invention outputs either 

5 a predicted conditional branch or an inverted predicted conditional branch as a final branch 
prediction outcome, in response to a predicted accuracy history signal based on an accuracy 
history bit, so that the two-level branch predictor can reduce the misprediction and a 
microprocessor can process branch instructions more efficiently. In this case, the branch 
prediction according to the present invention merely appends the accuracy history table 60 and 

10 multiplexer 70 to the conventional branch predictor. Thus, the branch prediction according to the 
present invention can reduce the misprediction with relatively simple circuitry and low hardware 
cost. 

2 While this invention has been particularly shown and described with references to 

«3 preferred embodiments thereof, it will be understood by those skilled in the art that various 
2, 1 5 changes in form and details may be made therein without departing from the spirit and scope of 
J f ! the invention as defined by the following claims. 
2 What is claimed is: 
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CLAIMS 



1 . A branch predictor comprising: 

branch prediction means for predicting a conditional branch of a branch instruction; 
a comparator for generating a comparison signal by comparing the predicted conditional 
branch from the branch prediction means with a real conditional branch of the branch instruction; 
an accuracy history table for storing an accuracy history of the predicted conditional 

branch; 

a first state transition logic circuit for generating an accuracy history bit to be stored to 
the accuracy history table in response to the comparison signal; and 

a multiplexer for outputting an alternative one of the conditional branch and an inverted 
conditional branch as a final branch prediction outcome, in response to a predicted accuracy 
history signal based on the accuracy history bit. 

2. The branch predictor according to claim 1 , wherein the branch prediction means 
comprises: 

a branch history register for storing conditional branches of previous branch 
instructions; 

a pattern history table for storing pattern history bits used for generating the predicted 
conditional branch corresponding to the conditional branches of the previous branch instructions 
stored in the branch history register; and 

a second state transition logic circuit for generating the pattern history bits in response to 
the real conditional branch of the branch instruction. 

3 . The branch predictor according to claim 2, wherein the second state transition 
logic circuit includes an up/down saturating counter. 

4. The branch predictor according to claim 1 , wherein the accuracy history table 
includes a memory array. 
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5. The branch predictor according to claim 1 , wherein the comparator generates the 
comparison signal having a first logic value when the predicted conditional branch is the same as 
the real conditional branch, and generates the comparison signal having a second logic value 
when the predicted conditional branch is different from the real conditional branch. 

6. The branch predictor according to claim 1 , wherein the first state transition logic 
circuit includes an up/down saturating counter. 

7. The branch predictor according to claim 6, wherein the first state transition logic 
circuit is used after learning the predicted branch accuracy of patterns of previous branch 
instructions. 

8. The branch predictor according to claim 1 , wherein the predicted accuracy signal 
is determined by a most significant bit of the accuracy history bit. 
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BRANCH PREDICTOR USING BRANCH PREDICTION ACCURACY HISTORY 



Abstract of the Disclosure 
A branch predictor outputs either a predicted conditional branch or an inverted predicted 
conditional branch as a final branch prediction outcome, in response to a predicted accuracy 
history signal based on an accuracy history bit. According to the accuracy history bit, it is 
determined whether the branch prediction outcome of the branch predictor is correct. If the 
predicted conditional branch is correct, the branch predictor outputs the predicted conditional 
branch, and if the predicted conditional branch is not correct, the branch predictor outputs the 
inverted predicted conditional branch, in response to the predicted accuracy history signal. For 
performing this process, the branch prediction appends an accuracy history table and a 
multiplexer to a conventional branch predictor, so that the branch prediction according to the 
present invention can reduce the misprediction with relatively simple circuitry and low hardware 
cost. 
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amended by *uy UMndiiHsai ivfbmd » hMn, 

i Kknowf rtge thr «tiiry tn disnUn* ipftmnitlon wWch b imttrU to ** *f ■wKcttton to 

I tmrty a Mm fcitigH Hwty M Wfcr Tto- », Ifattri ***** f 1 » «ny «»«tB* ippiii^^fbt 

mtoAt or in v«ntnt*ft «.ti feitll WTT Iftttmtitftllll ippllMiMBlMl^ 

the United <if li*c4 twfa*r »nd *»** 4* ifoitiiM «y *»W ipfMMfc^fl ^J""™* 

fnv-fibViccftificaic^ .my PCT intenutftooil *«pliiattioo(s) tfW!»t«tni tl tow one eowrtrj attorn™ ^ Waited 

of Aiwio* fil«r<* by m* cn ftibjMl flMrttw Imlug - (King d«* *■* ©f tfi* wpQItEXhMH*} ©T 

which priority » eltiftieri*. 



rttKOit KOHlaGWNrf AmtCAttQNW? AND AKV HttOWTV CLAIMS tiAC. j llgj 



couKmv 



± 



AFPl JCATTON Nt fMfctH j 0A1BOPHUNO 



fWOWTY cLAltifiDWtffift 



3Q:39fcld tttC9I£SK£080e99TT£ =01 t»S99-603-TE0 ©HnSWyS'WOyj IMJ t-T:£T 03-i0t 



00-OCT-E0 13:15 FROM: HANR PfiT SEOUL 



34537634 



TO: 617 4E6 EET5 



FfiGE:003 



fi-ni-T-io 14= 35 TOT SEOUL 34537635 

OCT-t7-2000 TUE as :3d PH SAWIKLS CAUTHIER BTIVEHS 



TO? 031 309 G054 
FAX NO. 8174SB227B 



P. 17/32 



if*' 



UfcSCLARATlON. WITtTlW AMD TOWER OF ATTOVMBY WM. PA1WT 

APPLICATION 



AtfcMMylMkftetN*'. 
BAM-W 



t benby chimthabw^tuntlKrTtlk M, United StlftMCwfe.fi 1 WMOftny United 



TtttOtt UA AirrUfcATlOfW FO» CTMEJTT tffOMt as UJIjC. 1 1 lttri< 

^ucATtfiwiauMtMtit i __rj yjw pJ2ATE 



1 tatty dtfan to tawftt under TWc 35, United Stmt Code. 9 120 of My Vniu* «« 
kimnwdmvl nppltonimii«) dwtootth* flic United S«m of Attm'tsf, fori WH* Ibttd 

»UWfCt tn fitter o> nch oHh* cMftU of thil dpcdlciliuii lstitftd»*tf Mod In fliMftfcOif prior ippllc«tiwri(ij tatlwiwijffliw 
DroWri Hy *• tot pwnrapli ofTMc 33, VnM 9mm CotM 1 12, t »*iw»M y ***** 
nforouttuii *t defined <■ Tide 37. Code <rf ftxfenl EbtfitafeMl. 1 1 .» WWofc hetwt,p " n ""» *™ * r 

4m prior appliaittonR titf <t* tutimt* or PCT HittnafinMl flllitf of this »«>i»«mob: 



wnoM u.s. applications on per iwTERWATitwAi. a^icatton^oesmsnaiingtiikc.s. 

FOR BKNEttT UHUMK » U-fcC * 



APPLICATION NIW1IBR 



DATBOFFtLtW 



STATUS: (PATENTED. PENDING 



Mamie* E.<HHthtor 



Rog, Ne. 

Mfu NO. lijffHl 

RNt No. 18.192 



AfiMM) I- F&wlfi 



StewtiMLMBi, Biq. 



DtimTBtaptar»C*Uito: 

(6tt)4Jo^t80*ltl. 149 



iu>ch«d ipedfkMtatt Md alilnw, and hwrty subscribe my mmm to «M spoeHtatto* md *t*«i» 
fon^e <J»cl«rtltton . f*>*«t Af •ttomoy, *ml AN p«ttiJt*i. 

1 harefry ^mc tW nil *i«4«in«Ati »Mk W^n ofmy «*w Ich«wM|p **» *i4 th* *N*lil«nnll iHW* !M 
infcrmailon tmd beltet «r« tatevwJ KsbtVW. md fpflher Sm tti-e rti&mw*i war* mKi« ^* _^^'T d * < ; 5J? 
wMftil folic slvlMiKitK «ml «» liko «• m*<!<i ««• ptWiA»>te byfimavr jmprUmHiun!. er ho«i, aMMt W™* ™" 
of TiUc 18 of the UtitWd SHWta Co* md UUK NC* wflimi IMic «m»wji» iw J«»|»««w H« °» 
■p)ille«t<on or anr jMium idMtd «>Mr«Mi. J 



SlgnMUK 



_X 0TTO 



Full N«m« of 
Ittlnvcfiibr 


CUtt 


fMtdMRPMM 


imnttiiiMiWiii 




lip wan. wiMl 


HipiMleaflCaMfe 






Mil, fknhwn Ail, 4#a, 
,iMlllJfM!l. 
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